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Structure of MERS-CoV spike receptor-binding domain 
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The spike glycoprotein (S) of recently identified Middle East respiratory syndrome coronavirus (MERS-CoV) tar- 
gets the cellular receptor, dipeptidyl peptidase 4 (DPP4). Sequence comparison and modeling analysis have revealed 
a putative receptor-binding domain (RBD) on the viral spike, which mediates this interaction. We report the 3.0 A- 
resolution crystal structure of MERS-CoV RBD bound to the extracellular domain of human DPP4. Our results 
show that MERS-CoV RBD consists of a core and a receptor-binding subdomain. The receptor-binding subdomain 
interacts with DPP4 p-propeller but not its intrinsic hydrolase domain. MERS-CoV RBD and related SARS-CoV 
RBD share a high degree of structural similarity in their core subdomains, but are notably divergent in the receptor- 
binding subdomain. Mutagenesis studies have identified several key residues in the receptor-binding subdomain 
that are critical for viral binding to DPP4 and entry into the target cell. The atomic details at the interface between 
MERS-CoV RBD and DPP4 provide structural understanding of the virus and receptor interaction, which can guide 
development of therapeutics and vaccines against MERS-CoV infection. 
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Introduction 

A novel coronavirus, recently named the Middle East 
respiratory syndrome coronavirus (MERS-CoV), was 
first identified in humans in the Middle East in 2012, 
and later in several European countries [1-3]. A sizable 
fraction (> 50%) of the infected patients developed se- 
vere respiratory illness and clinical symptoms similar to 
those seen during outbreak of severe acute respiratory 
symptom (SARS) caused by the coronavirus SARS- 
CoV in 2003 [4]. In particular, preliminary epidemiol- 
ogy studies suggest human-to-human transmission of 
this deadly virus, leading to global concern about the 
potential for a MERS pandemic [5, 6]. Genetic and phy- 
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logenetic characterization has shown that MERS-CoV 
belongs to lineage C of the genus of betacoronavirus 
and is closely related to Tylonycteris bat coronavirus 
HKU4 and Pipistrellus bat coronavirus HKU5, although 
the direct source and reservoirs of MERS-CoV remain 
uncertain [2, 7, 8]. Like other coronaviruses, the MERS- 
CoV virion utilizes a large surface spike (S) glycoprotein 
for interaction with and entry into the target cell [9, 10]. 
The S glycoprotein consists of a globular SI domain at 
the N-terminal region, followed by membrane-proximal 
S2 domain, a transmembrane domain and an intracellular 
domain [11-13]. Determinants of cellular tropism and 
interaction with the target cell are within the S 1 domain, 
while mediators of membrane fusion have been identified 
within the S2 domain [11, 12]. Through co-purification 
with the MERS-CoV SI domain, Raj and colleagues [9] 
recently identified that dipeptidyl peptidase 4 (DPP4, 
also called CD26) functions as a cellular receptor for 
MERS-CoV. DPP4 does not share any sequence or 
structural similarities to the previously identified human 
coronavirus receptors, such as angiotensin-converting 
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enzyme 2 (ACE2) for SARS-CoV [14, 15] and HCoV- 
NL63 [16], or aminopeptidase N (APN) for HCoV-229E 
[17]. Like ACE2 and APN, however, DDP4 is expressed 
on the surface of several cell types, including those found 
in human airways, and possesses ectopeptidase activity, 
although this enzymatic function does not appear to be 
essential for viral entry [18]. Sequence and modeling 
analysis of S glycoproteins from several human corona- 
viruses has revealed a potential receptor-binding domain 
(RBD) of MERS-CoV [10, 19]. However, given the rela- 
tively low degree of homology between S glycoprotein 
sequences and mechanisms of interaction with distinct 
cell surface receptors, it is likely that there is a significant 
variability in structural features among respective RBD 
receptor pairs. Here, we report the crystal structure of 
MERS-CoV RBD in complex with human DPP4 extra- 
cellular domain at a resolution of 3.0 A. 

Results 

Overall structure of the complex 

MERS-CoV RBD (E367 to Y606) and soluble DPP4 
extracellular domain (S39 to P766) with C-terminal his- 
tidine tag were expressed in Sf9 cells and purified by Ni- 
NTA and size-exclusion chromatography (Supplementary 
infonnation, Figure SI). The two components were then 
mixed, and the complex was purified by size-exclusion 
chromatography (Supplementary information, Figure 



SI). Crystals belong to the P6,22 space group with cell 
dimensions, a = b = 1 10.6 A, c = 527.6 A, a = P = 90°, j 
= 120°. The structure was determined by the molecular 
replacement method and refined to a resolution of 3.0 A 
with R wgrk of 19.2% and R free of 24.1% (Supplementary 
information, Table SI). There is one complex of DPP4 
extracellular domain with MERS-CoV RBD in the asym- 
metric unit. The final model consists of residues E382 to 
C585 of the MERS-CoV RBD, S39 to P766 of the DPP4 
extracellular domain, and glycans N-linked to residues 
N85, N92, N150, N219, N229, N281, N321 and N520 of 
the DPP4 extracellular domain. 

The DPP4 extracellular domain consists of an N- 
terminal eight-bladed (3-propeller domain (S39 to D496) 
and a C-terminal a/(3 hydrolase domain (N497 to P766) 
(Figure 1). The (3-propeller domain consists of eight 
blades, each made up of four antiparallel (3-strands (Sup- 
plementary information, Figure S2). DPP4 utilizes the 
blades 4 and 5 to contact the MERS-CoV RBD (Supple- 
mentary information, Figure S2). The contact site is far 
from the hydrolase domain (Figure 1), which is consis- 
tent with the previous findings showing that the addition 
of DPP4 inhibitors sitagliptin, vildagliptin, saxagliptin or 
P32/98 does not block MERS-CoV entry [9]. 

Structure of MERS-CoV RBD and its comparison with 
SARS-CoVRBD 

The MERS-CoV RBD contains a core subdomain 
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Figure 1 Overall structure of the complex. DPP4 extracellular domain consists of N-terminal eight-bladed p-propeller domain 
(green) and C-terminal a/p-hydrolase domain (orange). MERS-CoV RBD contains a core (cyan) and a receptor-binding sub- 
domain (purple). The disulfide bonds are drawn as yellow sticks and the N-linked glycans are drawn as pink sticks. 
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Figure 2 Structural comparison between MERS-CoV RBD and SARS-CoV RBD. Domain structures of MERS-CoV S1 (A) 
and of SARS-CoV S1 (B). (C) Structure of MERS-CoV RBD. The receptor-binding subdomain is colored in purple and the 
core subdomain is colored in cyan. (D) Structure of SARS-CoV RBD (PDB code 2AJF). The receptor-binding subdomain 
is colored in purple and the core subdomain is colored in wheat. (E) Schematic illustration of MERS-CoV RBD topology, p 
strands are drawn as arrows and a helices are drawn as cylinders. The disulfide bonds are drawn as yellow sticks. (F) Sche- 
matic illustration of SARS-CoV RBD topology, p strands are drawn as arrows and a helices are drawn as cylinders. The disul- 
fide bonds are drawn as yellow sticks. 



and a receptor-binding subdomain (Figure 1). The core 
subdomain is a five-stranded antiparallel P sheet (pi, P2, 
P3, P4 and P9) with two short a helices in the connecting 
loops (Figure 2C and 2E). Three disulfide bonds, con- 
necting C383 to C407, C425 to C478 and C437 to C585, 
reside in the core subdomain to maintain the fold (Figure 
2E). The receptor-binding subdomain is a four-stranded 
antiparallel P sheet (P5, P6, P7 and P8), located between 
strands P4 and P9 of the core domain (Figure 2C and 
2E). There is a long loop connecting P6 and P7 strands, 
which crosses perpendicularly to the P sheet (Figure 2C 
and 2E). The disulfide bond between C503 and C526 
connects this loop with strand P5 (Figure 2E), thereby 
providing structural support for the P sheet to contact 
DPP4. Similarly, SARS-CoV RBD also contains the core 



and receptor-binding subdomains (Figure 2D and 2F) 
[14]. Although MERS-CoV and SARS-CoV have low 
amino acid sequence homology in the RBD (Supple- 
mentary information, Figure S3), their core subdomains 
are structurally similar with an r.m.s.d. of 2.0 A for 95 
aligned Ca atoms. However, clear differences exist in 
the receptor-binding subdomain between MERS-CoV 
and SARS-CoV The former contains 84 residues (V484 
to L567, Figure 2A), forming a four-stranded antiparal- 
lel P sheet (Figure 2C and 2E), whereas the latter has 68 
amino acids (T425 to Q492, Figure 2B), forming a long 
extended loop with two short antiparallel P strands and 
one disulfide bond between C467 and C474 (Figure 2D 
and 2F). 



Cell Research | Vol 23 No 8 | August 2013 



Nianshuang Wang et al. @ 




Figure 3 Binding interface. (A) DPP4 contacts the MERS-CoV RBD with its blades 4 and 5 in the N-terminal eight-bladed 
p-propeller domain. Patch 1 is centered around the C-terminal end of the long linker connecting p6 and p7 strands in MERS- 
CoV RBD. Patch 2 has a gently concaved outer surface in MERS-CoV RBD that contacts a linker containing a short a helix 
between blades 4 and 5 of DPP4. Amino acid interactions in patch 1 (B), and in patch 2 (C). 



Binding interface 

The MERS-CoV RBD binds to the DPP4 |3-propeller 
domain with a buried surface of ~2 550 A 2 . At the inter- 
face, a total of 14 residues of MERS-CoV contact with 
15 residues of the DPP4 with a distance cutoff of 3.6 A 
(Supplementary information, Table S2). The interface 
consists of two major binding patches. In patch 1 (~1 250 
A 2 buried surface), the C-terminal end of the long loop 
connecting the (36 and (37 strands contacts the blade 4 of 
DPP4 (Figure 3A). MERS-CoV acidic residues E536, 
D537 and D539 at the C-terminal end of the loop form 
a negatively charged surface, and D539 has salt-bridge 
interaction with DPP4 basic residue K267 (Figure 3B). 
MERS-CoV residue Y499 fonns hydrogen bond with the 
DPP4 residue R336 in this patch (Figure 3B). In patch 2 



(~1 300 A 2 buried surface), there is a slightly concaved 
outer surface at the far end of the MERS-CoV receptor- 
binding subdomain, formed by the short (36 strand, C- 
terminal parts of (35 and (37 strands, N-terminal part of (38 
strand and the (35-(36 linking loop (Figure 3A). This con- 
caved outer surface accommodates a linker containing a 
short a helix between blade 4 and blade 5 of DPP4 (Figure 
3A). In this patch, a hydrophobic core is found, con- 
sisting of MERS-CoV RBD residues L506, W553 and 
V555, and DPP4 residues L294 and 1295, surrounded by 
hydrophilic MERS-CoV RBD residues D510, E513 and 
Y540, and DPP4 residues H298, R317 and Q344 (Figure 
3C). Among the peripheral hydrophilic residues, MERS- 
CoV RBD residues D5 1 0 and E5 1 3 form salt-bridge and 
hydrogen-bonding interactions with DPP4 residues R3 1 7 
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and Q344, respectively (Figure 3C). 

MERS-Co V critical residues for DPP4 binding and viral 
entry 

To study the biological relevance of these residues at 
the interface between MERS-CoV RBD and DPP4, we 
generated a series of mutant MERS-CoV RBD proteins 
and characterized their impact on binding activity to 
DPP4 and the entry efficiency of pseudotyped viruses 



bearing the mutant S glycoprotein. As shown in Figure 
4, mutations at several residues, either singly or in com- 
bination, resulted in a significant reduction in binding to 
DPP4 and lowered the efficiency of viral entry. In patch 
1, for instance, a single-residue substitution of Y499A, 
expected to eliminate hydrogen bonding between Y499 
and R336, significantly interrupted the binding between 
MERS-CoV RBD and DPP4, as well as hindered vi- 
ral entry (Figure 4). Similarly, combined substitutions 



B 



4? 



kDa 



Afi J S< <jx <ox <?x <oX 

^ <y <y <y <x 



^ J* <*> <£> 4? 
4? 4? 4? 4? 4? 



4? 4? <f 4^ 4T 

<jr <y <y <y <y 



-DPP4 



RBD 




ISO 



■£ 100 



O 50 




/ 



<p <r 



4? 



4? 



4P 
4? 



4 



Figure 4 Effect of residue substitution on MERS-CoV RBD binding to DPP4 (A) and entry efficiency of pseudotyped viruses 
(B). (A) SDS-PAGE analysis of co-purified complexes of wild-type or mutant forms of His-tagged RBD and untagged DPP4. 
The actual residue changes in the RBD are indicated above each lane. The DPP4 untagged serves as a negative control to 
exclude nonspecific binding of untagged DPP4 with Ni-NTA resin. (B) Entry efficiency of pseudotyped viruses bearing the 
wild-type and mutant forms of viral spike glycoprotein. The percentage of entry efficiency was calculated on the basis of lu- 
ciferase activity of mutant viruses versus that of the wild-type virus. Soluble RBD (-150 ng/ml) and DPP4 (-150 ng/ml) were 
also tested for their inhibitory activity against wild-type virus. One irrelevant soluble protein with the same concentration was 
used as a negative control. Error bars represent SD of two replicate experiments. 
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involving E536R, D537K and D539K, hypothesized 
to disrupt the native interaction with K267, resulted in 
profound reduction in the binding and viral entry levels 
(Figure 4). Furthermore, in patch 2, a single-residue sub- 
stitution of either L506A or W553A, expected to alter 
the hydrophobic core (Figure 3C), significantly reduced 
both binding and viral entry efficiency (Figure 4). In par- 
ticular, combined residue substitutions involving L506A, 
W553A and V555A resulted in even more profound 
reduction in viral entry efficiency (Figure 4), suggesting 
that the hydrophobic core formed between MERS-CoV 
RBD and DPP4 plays a critical role in mediating viral 
binding and entry into the target cells. Finally, a single- 
residue substitution of D5 1 OA that disrupts its salt-bridge 
interaction with DPP4 R317, or a change of E513A that 
disrupts its hydrogen-bonding interaction with DPP4 
Q344, also hindered, to variable degrees, both binding 
and viral entry; while the impact of other mutations such 
as R5 1 1 A was not significant (Figure 4). 

Discussion 

In summary, we report the crystal structure of MERS- 
CoV RBD in complex with the DPP4 extracellular 
domain and have identified several key residues in the 
S glycoprotein critical for viral binding and entry into 
target cells. In general, MERS-CoV RBD consists of two 
major subdomains, the core and receptor-binding sub- 
domain. While the former shares a high degree of struc- 
tural similarity to SARS-CoV RBD, the latter is notably 
divergent from known structures (Figure 2). This struc- 
tural distinction helps to explain receptor specificities of 
MERS-CoV and SARS-CoV [9, 15]. The high degree of 
structural similarity in the core subdomain, however, also 
suggests that there is a selection pressure for structural 
conservation or convergence in this region, despite pro- 
found sequence diversity. 

In terms of receptor binding, MERS-CoV RBD spe- 
cifically binds to blades 4 and 5 (Supplementary infor- 
mation, Figure S2) but not other blades in the P-propeller 
domain of DPP4, which could be explained by charge 
and shape complementarities at the binding interface 
(Supplementary information, Figure S4). Specifically, the 
P-propeller domain of DPP4 has three positively charged 
residues at the outer surface of blades 4 and 5 (K267, 
R336 and R317), which interact with the negatively 
charged residues on the surface of RBD (D510, E536, 
D537 and D539) (Supplementary information, Figure 
S4). Surrounded by these charge-charge interactions, 
the short a helix between blades 4 and 5 of DPP4 docks 
into the hydrophobic concaved surface of RBD (Supple- 
mentary information, Figure S4). These structural fea- 



tures suitable for RBD binding are only observed in the 
outer surface of blades 4 and 5 of DPP4. Furthermore, 
it is also clear that the enzymatic site of DPP4 is distant 
from MERS-CoV RBD-binding site (Figure 1), similar 
to the structure of ACE2 binding with SARS-CoV RBD 
[14]. This provides a structural explanation for previous 
findings showing that DPP4 enzymatic inhibitors do not 
block the viral entry of MERS-CoV [9]. However, the 
overall structural features of DPP4 are clearly distinct 
from those of ACE2. Such structural differences, in ad- 
dition to their potential differences in expression levels 
and tissue distribution, is expected to play a critical role 
in determining cell tropism as well as pathogenesis of 
MERS-CoV and SARS-CoV in vivo. Of note, small 
degree of sequence variation has been identified in the 
contact residues among DPP4 from different mammals, 
which may help to determine the cell susceptibility as 
well as the host range of MERS-CoV (Supplementary 
information, Figure S5). Nevertheless, our report of 
structural features of MERS-CoV RBD in complex with 
the DPP4 has provided atomic understanding of virus 
and receptor interaction, and will guide development of 
therapeutics and vaccines against MERS-CoV infection. 

Materials and Methods 

Expression and purification of human DPP4 

Human DPP4 extracellular domain (residues 39-766) with an 
N-terminal Hemolin signal peptide for secretion and a C-terminal 
6* His tag for purification was inserted into pFastBac-Dual vec- 
tor (Invitrogen). The construct was transformed into bacterial 
DHlOBac component cells, and the extracted bacmid was then 
transfected into Sf9 cells using Cellfectin II Reagent (Invitrogen). 
The low-titer viruses were harvested and then amplified to gener- 
ate high-titer virus stock, which was used to infect 2 L Sf9 cells at 
a density of 2 x 10 cells/ml. The supernatant of cell culture con- 
taining the secreted DPP4 was harvested 72 h after infection, con- 
centrated and buffer-exchanged to HBS (10 mM HEPES, pH 7.2, 
150 mM NaCl). DPP4 was captured by nickel (Ni)-charged resin 
(GE Healthcare) and eluted with 500 mM imidazole in Tris buf- 
fer (50 mM Tris, pH 8.8, 300 mM NaCl). DPP4 was then purified 
by gel filtration chromatography using the Superdex 200 column 
(GE Healthcare) pre-equilibrated with Tris buffer (25 mM Tris, pH 
8.8, 30 mM NaCl). Fractions containing DPP4 were collected and 
applied directly to a pre-equilibrated RESOURCE Q column (GE 
Healthcare) and then eluted with a 30-1 000 mM NaCl gradient 
in 25 mM Tris buffer (pH 8.8). Fractions containing DPP4 were 
finally purified with gel filtration chromatography using Superdex 
200 column. 

Identification and preparation of MERS-CoV RBD 

A series of constructs encoding different fragments of MERS- 
CoV SI protein were designed based on secondary structure 
prediction. Each fragment was cloned into the vector pFastBac- 
Dual (Invitrogen) with an N-terminal gp67 signal peptide and C- 
terminal His tag. All constructs were transformed into bacterial 
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DHlOBac component cells to obtain recombinant bacmid and then 
transfected into Sf9 insect cells. Viruses were harvested after 7 
days, amplified and tested for protein expression. One of the frag- 
ments (MERS-CoV RBD, residues 367-606) showed the highest 
level of protein expression and was able to be co-purified with 
untagged DPP4 using Ni-charged resin. This fragment was further 
produced in large quantity and purified using Ni-charged resin and 
HiLoad Superdex 75 Column (GE healthcare). 

Crystallization and data collection 

Purified RBD and DPP4 were mixed for 1 h and then loaded 
to Superdex 200 column. The complex was collected and concen- 
trated to -10 mg/ml in HBS buffer (10 mM HEPES, pH 7.2, 150 
mM NaCl) for crystallization. Crystals were successfully grown at 
room temperature using the sitting drop vapor diffusion method by 
mixing equal volumes of protein and reservoir solution containing 
25% (w/v) polyethylene glycol 1500. Crystals were frozen in liq- 
uid nitrogen with cryoprotectant (well solution plus 20% (v/v) gly- 
col) before data collection. Diffraction data were collected at the 
BL17U beam line of the Shanghai Synchrotron Research Facility 
(SSRF). Diffraction data were indexed, integrated and scaled with 
the program HKL2000 [20]. 

Structural determination and refinement 

The structure was determined by the molecular replacement 
method with PHASER [21] in CCP4 suite [22]. The search models 
are DPP4 extracellular domain structure (PDB code 2G63) and 
SARS-CoV RBD structure deleting the receptor-binding loop 
(PDB code 2AJF). Density map improvement by atoms update 
and refinement was performed with ARP/wARP [23], and auto- 
matic model extension of the missing receptor-binding subdomain 
of MERS-CoV was carried out with BUCCANNER [24], At the 
final stages of model building and refinement, glycans were added 
based on the electron densities. Structure validation was performed 
with PROCHECK [25]. All structural figures were made with Py- 
Mol [26]. 

Mutagenesis and Ni-NTA pull-down experiments 

All MERS-CoV RBD mutants were generated by site-specific 
PCR mutagenesis and were confirmed by sequencing. Baculovi- 
ruses expressing these mutants with C-terminal His tag were pre- 
pared in the same way as wild-type MERS-CoV RBD. Sf9 cells 
were coinfected by baculovirus expressing untagged DPP4 and 
a MERS-CoV RBD mutant baculovirus. The supernatant of 2 ml 
cell culture in six-well plates was harvested 72 h after infection 
and buffer-exchanged to HBS (10 mM HEPES, pH 7.2, 150 mM 
NaCl). Protein was captured by adding 40 (0.1 Ni-charged resin (GE 
Healthcare) and incubated at 4 °C for 3 h. After an extensive wash 
with Tris buffer (50 mM Tris, pH 8.8, 300 mM NaCl) containing 
20 mM imidazole, the bound protein was eluted with 40 ill Tris 
buffer (50 mM Tris, pH 8.8, 300 mM NaCl) containing 500 mM 
imidazole. Twenty microlitres of eluted sample was mixed with 5 
u.1 of 5x SDS-PAGE loading buffer and boiled at 100 °C for 5 min, 
and then loaded on 12% SDS-PAGE gel. The gel was stained with 
coomassie blue. 

Pseudotyped virus and viral entry 

Pseudotyped virus was generated by co-transfection of human 
immunodeficiency virus backbone expressing firefly luciferase 



(pNL43R-E-luciferase) and MERS-CoV spike glycoprotein ex- 
pression vector (pcDNA3.1+; Invitrogen) into the 293T cells [27]. 
Viral supernatants were harvested 48 h later, normalized by p24 
ELISA kit (Beijing Quantobio Biotechnology Co., Ltd., China) 
before infecting the target cells Huh7. The infected Huh7 cells 
were lysed at 48 h after infection and viral entry efficiency was 
quantified by comparing the luciferase activity between pseudo- 
typed viruses bearing the mutant and wild-type MERS-CoV spike 
glycoprotein. The inhibitory effect of soluble RBD and DPP4 was 
analyzed by incubating with pseudotyped virus bearing the wild- 
type MERS-CoV spike glycoprotein before the infection. 

PDB deposition 

The coordinates and diffraction data have been deposited into 
the Protein Data Bank with accession code 4L72. 
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